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Abstract 

We consider a MIMO fading broadcast channel where the fading channel coefficients are constant 
over time-frequency blocks that span a coherent time x a coherence bandwidth. In closed-loop systems, 
channel state information at transmitter (CSIT) is acquired by the downlink training sent by the base 
station and an explicit feedback from each user terminal. In open-loop systems, CSIT is obtained by 
exploiting uplink training and channel reciprocity. We use a tight closed-form lower bound on the ergodic 
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0^ ' achievable rate in the presence of CSIT errors in order to optimize the overall system throughput, by 



taking explicitly into account the overhead due to channel estimation and channel state feedback. Based 
on three time-frequency block models inspired by actual systems, we provide some useful guidelines for 



On 

the overall system optimization. In particular, digital (quantized) feedback is found to offer a substantial 
advantage over analog (unquantized) feedback. 

Keywords: MIMO broadcast channel, Multiuser MIMO Downlink, Channel State Information 
Feedback, Channel Estimation. 

I. Introduction 

The downlink of a wireless system with one Base Station (BS) with N t antennas and K User 
Terminals (UTs) with a single antenna each is modeled by a MIMO Gaussian broadcast channel 
[1], defined by 

y k [i] =hfx[i] + z fc [i], k = l,...,K (1) 

for % = 1, . . . , T, where yj.[i) is the channel output at UT k, Zk[i] ~ CN(0, No) is the corre- 
sponding Additive White Gaussian Noise (AWGN) process, h k £ C^' is the vector of channel 
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coefficients from the BS antenna array to the A;-th UT antenna and x[i] is the vector of channel 
input symbols transmitted by the BS, subject to the average power constraint E[|x[i]| 2 ] < P 
(enforced for each channel use i). We denote the downlink signal-to-noise ratio (SNR) at each 
UT by p = j^. We assume a block fading model where the channel vectors {h k } remain constant 
over a coherence block of T channel uses. The block length T is related to two physical channel 
parameters, the coherence time T c and the coherence bandwidth W c by T = W C T C . For example, 
taking as typical values W c = 500 kHz and T c = 2.5 ms (from [2]), we obtain T = 1250 channel 
uses. 

Albeit suboptimal, zero-forcing (ZF) beamforming with K — N t users captures the funda- 
mental trend in terms of degrees of freedom (or "multiplexing gain") [2]. Therefore, we focus 
on this case for its analytical tractability. In order to perform ZF beamforming (or any other 
multiuser MIMO precoding), the BS must have an accurate estimate of the downlink channel. 
Such information, referred to as the Channel State Information at the Transmitter (CSIT) is 
acquired by using downlink training and channel state feedback. On the one hand, in TDD 
systems with self-calibrating devices, owing to the fact that uplink and downlink take place in the 
same channel coherence bandwidth, CSIT can be acquired directly from the uplink pilot symbols. 
On the other hand, the uplink-downlink channel reciprocity does not hold in Frequency-Division 
Duplexing (FDD) systems where uplink and downlink take places in different widely separated 
frequency bands. This is also the case in Time-Division Duplexing (TDD) systems where uplink 
and downlink may time-share the same band but the non-linear devices are not self-calibrated 
and therefore induce non-reciprocal effects. In the latter case, an explicit CSIT feedback must 
be used. In any case, the rates achievable with ZF beamforming depend critically on the quality 
of the CSIT, however, high quality CSIT can be achieved by dedicating a significant amount 
of time resource to downlink training and (for FDD) to channel state feedback. It follows that 
there is a non-trivial tradeoff between the benefits of improving the CSIT and the overhead in 
channel estimation and feedback. 

In this work, we determine the optimum fraction of resources that should be dedicated to 
training/feedback in several cases of interest. In particular, we consider three time-frequency 
block models depicted in Fig. [TJ These models can be viewed as an idealization of the actual 
systems such as LTE [3] and aim at capturing the essential features. In SectionUm we consider the 
optimization of the net spectral efficiency based on model 1 where both training and feedback 
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consume "downlink" channel uses. This analysis applies naturally to TDD with or without 
reciprocity and FDD where downlink training and (uplink) feedback are performed in the same 
fading coherence block, via some hand-shaking protocol. In Section UVj we consider a different 
viewpoint based on models 2 and 3, in which the CSIT feedback consumes "uplink" channel 
uses. These models are more relevant to FDD systems. The question that we address is "how 
much uplink resource should one pay in order to achieve a certain downlink spectral efficiency?". 
By solving the corresponding optimization problem, we characterize the uplink/downlink spectral 
efficiency region. At which point of this tradeoff region the system should operate is a function 
of the specific system requirements such as uplink/downlink traffic demands. For a fixed demand, 
the optimal operation point can be found adaptively. Further, we study the effect of temporally 
correlated fading channels and feedback delay where CSIT is obtained through a one-step 
prediction model (model 3). This corresponds to the case when the downlink block bandwidth 
Wf and the block length Tj are significantly shorter than the coherence bandwidth W c and the 
coherence interval T c , respectively. Finally, Section |V] presents some considerations for the case 
of K > N t users with some downlink scheduling and user selection [4]. This case is very relevant 
in practice, but its analysis has escaped so far a full closed-form characterization. Therefore, we 
provide results by combining Monte Carlo simulation and closed-form analysis. 

The optimization of training has been studied in the context of point-to-point MIMO channels 
in the literature, e.g., [5], [6], [7], [8], [9]. In [5], the point-to-point MIMO communication 
is considered and only downlink training is addressed for the case of no CSIT and imperfect 
Channel State Information at the Receiver (CSIR). On the other hand, in [7], perfect CSIR is 
assumed and the resources to be used for channel feedback are investigated. In [8], [9] the model 
of [5] is extended to also incorporate quantized channel feedback and transmitter beamforming. 
Although the setup is quite similar to ours, the emphasis of [8], [9] on the asymptotic regime, 
where the number of antennas and T are simultaneously taken to infinity, leads to rather different 
conclusions as compared to the present work. In [6], a MIMO broadcast with downlink training 
and perfect channel feedback (i.e., the BS is also able to view the received training symbols) is 
considered. It is shown that the sum rate achievable with a dirty paper coding-based strategy has 
a very similar form to the achievable rate expressions in [5], and thus many of the conclusions 
from [5] directly carry over. On the other hand, we consider the more practical case where there is 
imperfect feedback from each UT to the BS and also study achievable rates with ZF beamforming, 
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which has lower complexity than dirty-paper coding. The present work is an extension of [10], 
[11], where the same optimization was investigated assuming that both downlink training and 
uplink feedback are performed within the same block (model 1). In this paper, we provide more 
complete guidelines on the overall system optimization for the various scenarios of interest. 



When the multiuser MIMO downlink is operated in a closed-loop mode, the CSIT is obtained 
through the following phases: 

1) Common downlink training: T tr shared pilot symbols (i.e., ^ pilots per BS antenna) are 
transmitted on each channel coherence block to allow all UTs to estimate their downlink channel 
vectors {h k } based on the observation 



Using linear MMSE estimation, the per-coefficient estimation error variance is given by 



2) Channel feedback: Each UT feeds back its channel estimation immediately after the training 
phase. We focus on the scenario where the feedback channel is modeled as an AWGN channel 
with the SNR p, identical to the nominal downlink SNR. Because UT's are assumed to access 
the feedback channel orthogonally, a total of channel symbols translates into ^ feedback 
channel uses per UT. Different feedback strategies are described in Section HITl 

The BS obtains the channel state matrix H = [hi, . . . , h Nt ] based on the training/feedback 
information. Errors in the CSIT available to the BS stems from two sources: the channel 
estimation error during the common training phase, and the distortion incurred during the 
feedback phase. Then, the BS computes the ZF beamforming vector v fc to be a unit-norm vector 
orthogonal to the subspace = span{h^ : j ^ k} for all k. In this case, the ergodic rate 
achievable by UT k with equal-power allocation across UT's and Gaussian random coding is 
given by: 



II. Channel State Estimation and Feedback 




(2) 



1 



(3) 





(4) 
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assuming each UT is aware of its received signal-to-interference plus noise ratio (SINR)Q The 
residual interference due to non-zero "leakage" coefficients {|h£vj|} decreases the achievable 
rate. In [12], it is shown that the rate in (|4]) is tightly lower-bounded by 



R k > Rf - AR k (5) 



where i?f F is the rate achievable with perfect CSIT and AR k denotes the rate gap, given in 
closed form by 

Ai? fe = logfl + -f E E O h ^| 2 ]V (6) 



Assuming that the channel statistics are symmetric over users and space, R k , Rf F and AR k 
do not depend on k, therefore the subscript k will be omitted in the following. The rate gap 
depends on T tr , T^ and the training/feedback strategy and will be generally denoted by the 



function AR(T tI1 Tfb). Explicit expressions are found in [12] for the cases addressed in this 
paper. 

III. Joint Optimization of Training and Feedback 

In this section, we focus on model 1 of Fig. [TJ where training and CSIT feedback consume 
downlink channel uses. Model 1 (a) refers to the TDD system exploiting the channel reciprocity, 
while model 1 (b) refers to either the TDD without reciprocity or the FDD system in which the 
downlink training and the feedback are performed in the same fading coherence block. In both 
cases, the maximization of the net downlink spectral efficiency is formulated as 



max 

Ttr.TffalTtr+Tfb^T 



(l - (R ZF - ~AR(T tT , Tfb)) . (7) 



It is convenient to consider the maximization in two steps, by writing: 



max max 

T t <T Ttr+T^Tt 



(i - (R ZF - Ai?(T tr , Ik)) . (8) 



Furthermore, the rate gap can be put in the general form (see [12]) 



AR(T tr , Tfb) = log (1 + <7(T tr , Tfb)) (9) 

'Such knowledge can be acquired through an additional dedicated training round as discussed in [12]. This training round 
does not significantly affect the present work, and thus is ignored for the sake of simplicity. 
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where the function g(-,-) depends on the feedback strategy and shall be specified later. Because 
the first multiplicative term is constant when T tr + Tfb = T t , the inner maximization corresponds 
to minimization of the function g(-, •), subject to the constraint T tr + Tfb < T t . Letting g(T t ) = 
min Ttr+Tfb < Ti (7(T tr ,Tfb) denote the solution of the inner maximization in ([8]), we can solve the 
outer maximization by searching for the optimal value < T t < T. 



A. TDD with channel reciprocity 

When channel reciprocity holds, open-loop CSIT estimation can be obtained from the uplink 
pilot symbols. In this case, the amount of uplink training can be optimized as a special case of 
([8]) where no CSIT feedback is usedj^l In [12, Remark 4.2], the rate gap for a TDD system that 
uses Ttr uplink training symbols is given by: 

, N t -r 



AR = lo^ 

which corresponds to g tdd (T tv ) = with 9 t 
the net spectral efficiency given by 



T 



tr 



(10) 



N t — 1. Plugging this into ©, we maximize 



T 



J-tr 



(ID 



Because /(•) is concave in T tr , the optimal T t * can be found by numerically solving for -^r- = 
where 

df e tr (l - i 



n 1 1 



Ttr 



T 



R 



ZF 



log 1 



(12) 



Although a closed-form solution for T t * cannot be found, we can study the scaling of the optimal 
T t * with the system parameters. It is not difficult to see that the derivative in (fT2~)) is upperbounded 
by i/(T tr ), where 



f(Ttr) 



etr (T - Ttr) 

Tl 



dZF _ ^tr_ 
" ^r 



The concavity of /(•) implies that the solution T t of the equation f(T tT ) 
to the optimal value T*. Solving for f(T tT ) = 0, we find 



K < T 



tr 



OtrT 

R ZF ' 



(13) 

is an upper bound 
(14) 



Note that a similar optimization is considered in [13], although in that work analysis of this optimization is not performed. 
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0tr 
7tr 



Hi 



(which becomes accurate 



Furthermore, when the rate gap is small such that log 
for large T), the upperbound also becomes a very good approximation. 

Two interesting behaviors are obtained from (12TT) : 1) for a fixed SNR (i.e., constant R ZF ) T t * 
increases as 0(VT) as T -> oo; 2) for a fixed block length T, T* decreases as 0(l/\/W F ) for 
large SNR, or equivalently, it decreases as 0(l/-y/log(p)) since i? ZF = log(p) + 0(1) for large 
SNR. 

Next, we examine the impact of T t * on the net achievable rate. By the definition of T t * we 
have: 



f(T*)>f[T t 



9i 



R ZF T 



R ZF - lot 



6trR ZF 



T 



(15) 



The rate gap with respect to R ZF can therefore be upper bounded as: 



R 



ZF 



f(T*)<R ZF -f[T t 



6 tr R ZF 



T 



+ log 1 



e tr R ZF 



T 



'tr 



R ZF T 



log 1 



< 2 



# trJ R ZF 



(16) 
(17) 

(18) 



where the final inequality is reached by dropping the last term in (flTI) and using log(l +x) < x. 
Thus, the gap to a perfect CSIT system decreases roughly as 0(1/ VT) as T increases. 

For a future reference, it is worthwhile to notice that model 1 (a) corresponds to model 1 (b) 
with perfect feedback such that the BS knows the UT channel estimates. As a result, the net rate 
achievable with TDD, channel reciprocity and open-loop CSIT estimation serves as an upper 
bound to the rate achievable with any form of CSIT feedback considered in the following. 



B. Analog Feedback 

An option for the CSIT feedback scheme consists of sending the channel coefficients as 
QAM unquantized modulation symbols. This is usually referred to as "analog feedback" in the 
literature, since the scheme is indeed akin to analog amplitude/phase modulation. Because each 
UT is allowed ^ feedback channel uses, this scheme transmits each channel coefficient over 

iVi 

Sfr feedback channel uses (if > Nf, each coefficient is effectively repeated 2% times on 
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the feedback channel). At the BS receiver, MMSE estimation is used. The resulting rate gap is 
described as [12, Section IV] and results in the g(-, •) function 

^ nal o g(Ttr; Tfb) = Nt_l + N t (Nt-l) _ (19) 

-ttr J-fb 

For the sake of generality, we consider a generalized form of (fl9l) as g analog (T tr , Tfb) = ff 1 + 
for two non-negative weights tr and 9^. Comparing (fl9l ) with (flOl) . we notice that the previous 
TDD open-loop case corresponds to letting 6^ = 0, consistently with the fact that in this case 
no CSIT feedback is used. 

It is immediate to check that the minimization of 5f analog (T tr , Tfb) subject to T tr + Th = T t , 
and to T tr ,Tfb > is a convex problem. The corresponding Lagrangian [14] is given by 

£(T tr , Tfb, n) = g(T tI , Tfb) + -^(T tr + T^) 

A 4 

where > is the Lagrangian multiplier for the equality constraint. The KKT conditions [14] 
yield the solution T t * = v^W 1 and T^ = \/#fb/^ Imposing the equality constraint and eliminating 
/i, we obtain: 

T ^ = V^ T " T ^ = \/¥ T * (20) 

I A | A 

where we let X = (y^ + V^lb) 2 , and the resulting objective value is given by g analo s(T t ) = ^. 

The outer optimization (step 2) is now characterized in terms of a single variable T t and 
reduces to the maximization of CCD) where we replace T tr and # t r by T t and 3C, respectively. As 
a result, we find the optimal scaling for T t as 

~ / TCT 

^<^ = V/^- (21) 

Hence, the same analysis holds for the total length T* of training and feedback. In addition, the 
following upper bound on T t * can be obtained by combining ([2~TT) with (1201) 




22 - V = V ^ = V ""i^- (22) 

According to this upperbound, the optimal downlink training is independent of 9^, and thus of 
the efficiency of the feedback channel. 

Similarly, we obtain the effective rate gap with respect to R ZF as 



9 tv T (N t -1)T 



%R ZF 



i^-/(2J)<2y-^ r - (23) 

Comparing this and the corresponding expression (fT8l) for the open-loop TDD, we see that the 
analog feedback incurs a rate gap increase by a factor 1 + y/W t . 
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C. Error-Free Digital Feedback 

We now analyze a digital feedback technique where each UT quantizes its estimated channel 
vector into a 5-bits message and then maps these bits into 2k transmit symbols. For the 
quantization step we consider an ensemble of random vector quantizers (RVQ) with directional 
quantization as described in [15]. Assuming the feedback messages are received error- free, in 
[12, Section V] it is shown that the rate gap is given by 

/ N - 1 b \ 

AR = log I 1 + — + p 2 *=rj. (24) 

For the time being, we assume unrealistically that error-free communication is possible over 
the feedback channel at a rate equal to its capacity of log 2 (1 + p) bits per channel use. Letting 

B = 2k l g 2 (1 + p), we obtain 

/igital (Ttr) Tfl,) = ^— - + p (1 + p)~W^ . (25) 

Following the two-step approach, we minimize the above function subject to T tr + Tr, = T t . 
Since g dl s ital (- ; •) is convex in T tr , Tr,, we form the Lagrangian and readily obtain 

T tr = pv 7 ^ 3 ! Tfl, = JV t (JV t - 1 ^ V ( V (26) 

m(l + p) 

where // > is chosen so that the equality constraint is fulfilled. Note that Tfb grows as O (In /x), 
much slower than the linear increase (in jx) for T tr . 

Contrary to the earlier analog feedback case, we cannot express g dlgltal (T t ) in a simple closed 
form. However, using (1261) we can eliminate p, and express Tfb as a function of T tr : 



2]n(T tr ) + ln( 5 $g±g I 



^•w- 1 ) ln(1+ V ■ (27) 



and thus the net spectral efficiency can be written as: 

/ 2 1n(T tr ) + lnf P^ + P) \ \ 



_ ln( v 1+ ;; 



i? ZF - lot 



T ta (T tr ) 2 ln(l+p) ) 



X 

V / 

Because Tfb increases logarithmically in T tr , and decreases with the SNR p, its effect on the 
maximization is rather negligible. As a result, the maximization of T tr is very similar to the 
case of TDD with channel reciprocity. In other words, the error-free digital feedback performs 
almost as good as the TDD open-loop upper bound. 
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D. Digital Feedback with Errors 

We consider a practical digital feedback scheme with a very low complexity. In particular, we 
assume that the B feedback bits are transmitted on the uplink by using uncoded QAM. Each 
UT makes use of ^ feedback channel uses for its CSIT feedback. Assuming that quantization 
bits are arbitrarily mapped to the QAM constellation symbols, the error of any symbol renders 
the feedback from a particular UT effectively useless and thus leads to a zero rateO Under this 
assumption, the achievable net spectral efficiency is given as a solution to 



max 

Tt^TfbiTtr+Tfb^T 



(i - ^y^) (i " Pefi,) [RT - 2S] (28) 



where A/2 is defined in (124)) and where P e ,fb is the feedback message error probability. The size 

BNt 

of the QAM constellation is given by M = 2 T ft> and yields a symbol error probability [16] 




and a corresponding feedback message error probability 

P e|fb = l-(1-P s pfc. (30) 

Following the two-step optimization approach, we rewrite the outer optimization as 



max 1 — — 



R ZF - AR(T t ) 



(3D 



T t <T y t 

where the effective rate-loss AR(T t ), incorporating the loss due to erroneous feedback, is given 
by 

AR(T t ) = min ((1 - P e , ft ) log ( 1 + ^— - + P M~^W^ ) + P^f } . (32) 

Ttr +T lh =T t L \ l tr J ) 

If the QAM constellation size is suitably optimized, the probability of feedback error can be 
made sufficiently small when the number of feedback bits per user is large. For example, 
for N t = 4 at 10 dB with B = 25 bits and 4-QAM, we have P e)fb = 0.0194. As a result, 
the minimization in (1321) is very similar to the minimization of g dlgltal (T tr , Tfb) for error-free 
feedback in (1251) . 

We conclude this section by providing some numerical examples to compare the performance 
of different feedback strategies. In Fig.|2]the optimal values of T tr and are plotted versus block 

3 This point can be made rigorous, but we limit ourselves to the present intuitive argument for the sake of space limitation. 
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length T for analog feedback, error-free digital feedback, and QAM-based digital feedback along 
with the uplink training length T tr for the TDD system. Most striking is the fact that the optimal 
values of T tr are essentially identical for the three feedback techniques as well as for TDD. 
Furthermore, although not shown here, the optimal values of T tr are very well approximated 
by \J ^jpP^ as in (l22l) . The number of feedback symbols, however, depends critically on the 
feedback method. Because analog feedback is so inefficient, a large number of feedback symbols 
are used so that the rate gap due to feedback is minimized. On the other hand, digital feedback 
is very efficient and a relatively small number of feedback symbols is required. 

In Fig. [3l the sum spectral efficiency is plotted versus block length T. Although not shown 
here, the rate approximations based upon (fl"8l) are seen to become increasingly accurate as T 
increases for analog and TDD. Analog feedback is outperformed by digital feedback with or 
without errors, for any T. This is because digital feedback offers a significantly smaller distortion 
as compared to analog whenever Tr, is larger than (approximately) Nf (i.e., one symbol per 
channel coefficient) [12, Section VI], and for reasonable block lengths it is optimal to use 
considerably larger than (see Fig. O. 

IV. Separate Uplink and Downlink Bandwidths 

In FDD systems, the uplink and downlink bandwidths are generally separated and the amount 
of channel uses per block length dedicated to the CSIT feedback impacts the uplink spectral 
efficiency as an overhead, rather than the downlink as in the previous section. In this section 
we focus on models 2 and 3 of Fig. \T\ assuming that the downlink and uplink bandwidths are a 
priori fixed. The challenge here consists of determining the tradeoff region of downlink spectral 
efficiency versus uplink CSIT feedback overhead. 

For this purpose, we consider the net downlink spectral efficiency, accounting for the training 
overhead, as a function of T&. For each value of T^, the optimal number of downlink training 
symbols is found, and the corresponding net downlink spectral efficiency is given by: 

w (r ft ) 4 max (l - ^) (V - log (l + + A(r ft )) ) (33) 

where A(Tfb) denotes the loss term due to CSIT feedback. By solving for the maximization 
with respect to T tr , we obtain a tight lower bound on the optimal downlink spectral efficiency 
achievable with ZF beamforming as a function of the parameter Tfb, that quantifies the number 
of channel uses per block spent for the CSIT feedback over the uplink. 
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In the following, we first characterize such a tradeoff for the cases of the AWGN feedback 
channel based on model 2. Then, we address the case of a temporally correlated channel with 
feedback delay and channel prediction by considering model 3. 

A. AWGN feedback link 

For the orthogonal access over the AWGN feedback channel, we have A(Tfb) = Nt{ %T 1] for 



T, 



fb 



analog feedback, or A(Tfb) = p(l + p)~ N t( N t-v for error-free digital feedback (see (fl9l) and 
<l25l)). As seen previously, the effect of feedback errors can be made sufficiently small even by 
very simple schemes based on uncoded QAM modulation. Hence, due to the space limitation, 
we provide only the analysis for the case of error-free digital feedback operating at the uplink 
AWGN capacity, which captures the essential behavior of digital feedback while allowing for 
much simpler analytical expressions. Nevertheless, in the numerical results we provide also the 
results for a 4QAM-based digital feedback for the sake of comparison. 
By simple manipulation, the objective function can be rewritten as: 

1 - T f) ( rZF - "* " + A ™> - "* (> + rokyy)) ■ (34) 

Hence, the optimization has the same form as in Section IIII-AL with R ZF replaced by R ZF — 
log (1 + A(Tfb)) and N t — 1 replaced by y^^y- ^ f 01 l° ws mat we can immediately write the 
bound on the optimal training length as 



TZ(T & ) < T^T*) - \l ( ^z F _ log(1 +\ T ^ )(1 + A(Tft)) - ( 35 ) 
Although T t *(Tfb) does depend on Tfb, this dependency is very weak whenever T* is not too 



small. Thus, very little is lost by simply choosing T tr = y T . 

Using the same arguments as in Section IIII-Al the downlink spectral efficiency can be lower 
bounded by 




w (T*) > ( 1 ~ J^r 1 ( R ZF - log f 1 + \l m{N : l > + A^) 1 1 (36) 



3 zf ,jR ZF (N t -l) [ 1 -\JtW? 



N t -1 



>R L * -2\ ^ '-- / : A^). (37) 

T 
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Using the expressions for A(Tfb) we have: 



\z^M \mzn (38) 



i? ZF (iVi - 1) / 1 



/ R ZF (Nt- 


1) 


T 




i 


V rri"' 




/R ZF (N t - 


-1) 



^digital (Tfb) > ^ZF _ 2^/ - - | ] p (1 + p)-^FTT . (39) 

Notice that the spectral efficiency penalties due to training and feedback are separable in these 
lower bounds. Based upon these expressions, we expect that the downlink spectral efficiency 
u> dlgltal (Tfb) with digital feedback converges very quickly to the rate accounting for the optimized 
training overhead, which is approximately R ZF — R ^ t —^-, whereas convergence is much 
slower with analog feedback. 

The above definitions of w analog and u/ dlgltal characterize the net downlink spectral efficiency 
as a function of the number of uplink symbols per block length used for CSIT feedback. In 
terms of system design, it is more meaningful to characterize the downlink rate as a function 
of the uplink bandwidth used for channel feedback. Under the block-fading model adopted in 
this paper, the channel is constant for T c seconds over the bandwidth of W c . Since uplink 
symbols are used for channel feedback for every block, the uplink bandwidth used for channel 
feedback is given by ^ Hz, and the downlink rate is given by W c w(T^) in bit/sec (bps). 

We can take advantage of the above analysis to understand the fundamental tradeoff between 
downlink and uplink rate. To this end, we employ a simplistic model of the uplink in which we 
assume the uplink bandwidth of W up Hz and the uplink spectral efficiency of C up bps/Hz. Since 
feedback consumes jr- Hz of uplink bandwidth, the remaining bandwidth of W up — jr- Hz is 
available for uplink data transmission. Thus the uplink data rate is 

i2u P (Tfb) = (V up - C UP - (40) 

while the downlink rate is 

R down (T ib ) = W c w(T {b ). (41) 

As Tfb increases, the downlink rate -Rdown increases at the expense of decreasing uplink rate 
R up . In order to determine the operating point on the (-Rdown, R up ) Pareto-optimal boundary, a 
common method consists of maximizing the weighted sum of rates: 

max Ai? down (rfb) + Ai? up (Tfb) (42) 
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where < A < 1 and A = 1 — A. This optimization is equivalent to 

max XWcWiT^-xf^jC^. (43) 

Tib V 1 c J 

After multiplying both sides by T c and taking the derivative with respect to T^, we see that the 
optimal solution satisfies: 

XTw'(T fh ) = XC up - w' (T & ) = —C^. (44) 

More precisely, we obtain the optimal Tfb as a function of A as 



7S-(A) = . n {Nt - 1)TA (45) 



Cup A 



T a igital(A) = AW-1) /rpl 0g (l + p) TAA 
03 ^ ' log(l + p) 8 U(JV ( -WJ 



l-y^ 

where we let r = / r zf R n l : - Clearly the feedback length is non-negative and upper bounded 

by T. Compared to analog feedback, the feedback length T^ lgltal (A) with digital feedback is 
almost insensitive to A except the corner points (A = 0, 1). By plugging the above expressions 
into (1381) . (l39l) . the achievable rate can be parameterized by A such that 



m »,„ E(A) > flZF _ 2 ^ Zr W-D _ JrJWj^X (4?) 



.digital / \ \ \ D ZF r. 

/_RZF(JV, - 1) JV,(JV, - l)C up A 



^(X)>R*-2,I^^- f A - g(1 -y . (48) 

The third term, representing the rate loss due to the imperfect feedback, is rather marginal both 
for analog and digital feedback schemes for a large T in the range < A < 1. From these 
expressions, it can be expected that the tradeoff curve with digital feedback is sharper and 
dominates the curve with analog feedback. 

To make this discussion more concrete, consider a single resource block in LTE, with band- 
width 200 kHz and duration 1 ms, corresponding to T = 200 in our model. We assume 
C up = 1.512 bps/Hz (per user) and an uplink bandwidth also equal to 200 kHz, for the sake of 
symmetry. The uplink-downlink sum rate boundary (expressed in kbps) and the corresponding 
feedback lengths are shown in Figs. 0] and [5] A well-designed system will typically operate near 
the sharp "knee" of the curves of Fig. HI where the downlink rate is very close to its maximum 
value. Fortunately, because of the relatively low cost of channel feedback, the uplink rate is also 
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reasonably close to its maximum. From Fig. [5] we remark also that analog feedback requires 
a longer Tfb for a larger weight A while the feedback length with digital feedback is almost 
constant. The training length was found to be 24 symbols for any scheme except for A « 0. 
Note that the choice T = 200 is quite conservative. As argued in Section HI typical physical 
channel parameters yield a significantly larger T for low mobility users. 

The takeaway message of section is that, unless uplink data rate is very strongly preferred over 
downlink data rate, it is efficient to operate the system at a point where the downlink spectral 
efficiency is very close to the perfect-feedback case. 

B. Delayed feedback channel 

In this section we study the uplink/downlink tradeoff by taking into account the effect of the 
feedback delay and the temporally correlated channel based on model 3. This model is motivated 
by the following scenario. In practice, the downlink resource allocation blocks, i.e. the block 
bandwidth Wf and block length Tf, might be defined a priori independently of W c and T c , while 
these coherence parameters depend on the propagation environment as well as the users mobility 
and may even vary from user to user. For the case of a fixed block length T = WfTf much 
shorter than W C T C , the channel coefficients in subsequent blocks are correlated. 

In order to model such situation, we assume that the channel fading coefficients are constant 
within each block of T symbols and changes from block to block according to a stationary 
Gaussian random process with power spectral density (Doppler spectrum) Sh{£), strictly band- 
limited in [— F, F], where F < 1/2 is the maximum normalized Doppler frequency shift, given 
by F = ^T f , where v is the mobile terminal speed (m/s), f c is the carrier frequency (Hz), c is 
the light speed (m/s). Furthermore, such a "Doppler process" satisfies Jj F i? logS'/ l (£)<i£ > — oo. 
This condition holds for most (if not all) channel models usually adopted in the wireless mobile 
communication literature (see [17] and references therein), where the Doppler spectrum has no 
spectral nulls within the support [— F, F]. Because of symmetry and spatial independence, we 
can neglect the antenna index and consider scalar rather than vector processes. 

Contrary to the block-by-block estimation previously considered, each UT k estimates h k (t) 
based on the observation {s k (t — t) : r = d, d+1, . . .,00} available at UT k up to block t — d 
where d denotes the feedback delay in blocks of length WfTf and s k (t) = \J^-jf-h k (t) + z k (t) 
is the received signal at UT k at block t. We focus on the case of d = (filtering) and d = 1 
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(prediction) in the following. The equivalent model for both cases is given by 

h k {t) = h k {t)+n k {t) (49) 

where h k {t) = K[h k (t)\{s k (t — r)}] denotes the estimated channel, independent of the estimation 
error n k (t) ~ G>J"(0, of r ). The one-step prediction MMSE (d = 1) is given by [18], [12] 

€l (S) = S 1 '^ e^p (j\g(S + S h (^)d^j - S (50) 

where we assume a unit-power process, f^ F Sh(£,)d£, = 1, observed in background white noise 
with per-component variance 5 = The filtering MMSE (d = 0) is related to ei(<5) through 
the well-known maximal ratio combining formula 

eo(5) = JTWY ( } 

Since h k (t) and n k (t) are independent, we have E[|/i fc (t)| 2 ] = 1 — <7 t 2 r for any k. 
In [12, Section VI. B], it is shown that the rate gap is upper bounded by 

AR d < log (l + + A(T ft )) . (52) 

For simplicity, we focus on the case of a uniform Doppler spectrum Sh(£) = for — F <i<F. 
This yields 

where the last inequality can be easily shown. Using (|5TT) and (1531) we obtain 

< ! (54) 

5 ~ 1 + (2F5) 2F K ' 

Plugging these expressions into (|52|) . we obtain the rate gap upper bounds as 



rd=0 



AR = log 



l + A(T fb ) + 



iV*-l 



tr 



( 2FN t \ 



^logll + A^ + ^-^K 55 ) 



tr 



Afl" = logll + A^ + ^-^j |. (56) 

We observe that that for the case of filtering (d = 0), the rate gap upper bound reduces to that 
of the AWGN feedback link for sufficiently large p. In what follows, we consider the more 
interesting case of one-step prediction. 
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We can again maximize the net downlink achievable spectral efficiency for the one-step 
prediction case by solving 



w 



(r ft ) ^ max (l - ^ [i? ZF (P) - log (l + «T t f + A(T ft ))] (57) 



where we defined the constant k 



2FN t 



2F 



By letting the RHS of d57]) denote 



/(r tr ,Tfb), we remark that the objective function /(•, •) is concave in T tT . The the optimal T tl 
in (1571) satisfies 



K(l-F)(T-T t 



trJ 



^ ZF - log(l + A(Tfb)) - log I 1 + 



2-F i 



l-F 



K,T t 



-(l-F) 



tr 



(58) 



ACTf,,)) l + A(T ftj 

Following the same arguments as before, it follows that the solution T tr to the equation f(T tT ) = 
is an upper bound to the optimal T t *, where 



k(T - T tr ) 



tr) 



T£~ (1 + A(Tfb)) 



Explicitly, we find 

T t *CTfb) < T tr (T ft 



i?^ - log(l + A(Tfl,)) 



(iV t - 1)T 



-(l-F) 



A(T f 



fb 



(59) 



2F 
2-F 



(60) 



1 + A(T fb )){i?zF - log(l + ACTfb))}; V2FM/ 
As T increases, the training length T tr scales as OiT" 1 ^) depending on the Doppler frequency 
shift < F < ~. For a fixed T, the training length is increasing in F. When the fading is 
quasi-static (i.e., very low mobility users with v ~ 0) such that the channel becomes perfectly 
predictable, the training length coincides with the expression (1351) for the block-by-block esti- 
mation. Since the term A(Tfb) is negligible for a sufficiently large Tft,, we can choose with little 
loss of optimality 



tr 



KT 



(N t - 1)T\^ 



P 



2F 
2-F 



(61) 



r zf J y r zf j \2FN t 

Following in the footsteps of what has been done before, we can obtain the lower bound of the 
downlink spectral efficiency as 



w(Ti 



> 



(l - y-) [R ZF - log (1 + ^T t f - 1 + A(Tfb))] 



> R 



ZF 



t rZF + w - » (mr, 



2F 



->2F-1 



- ^)A(T f 



fbJ 



1 + (iV t - 1) 



2F 

_ \ rp2F 



2FN t 



tr 



(62) 
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where we can replace T tr by (loTI) . Solving the weighted sum rate maximization, we obtain 

the optimal Tfb in the same form of (1451) and (l46l) . for analog feedback and error-free digital 

l_2k 

feedback, respectively, where the term r is now replaced by ; — T 2F . 

i+W-i)(3i%) ^ 

In order to quantify the impact of the delay on the uplink-downlink tradeoff, Fig. [6] shows 
the uplink-downlink sum rate Pareto boundary for different mobile speeds v = 6, 50, 80 km/h 
yielding the Doppler shift of F — 0.011, 0.093, 0.148, respectively, with the same parameters as 
Fig. HI The corresponding feedback length as a function of A is shown in Fig. [71 where we only 
plotted for v = 6, 80 km/h for the sake of clarity. We recall that A = 1 corresponds to the corner 
point (-Rdowm 0) while A = corresponds to the other corner point (0,R up ). As expected from 
(loTI) . the training length increases for a higher mobile speed and is found to be 25, 36, 43 symbols 
for v = 6, 50, 80 km/h, respectively. On the contrary, the feedback length is rather indifferent to 
the mobile speed v, although it tends to decrease for a larger v. On the uplink-downlink tradeoff 
curve, the higher mobile speed decreases significantly the downlink rate since the larger training 
length incurs a significant rate loss. 

Fig. [8] shows the achievable downlink sum rate in kbps versus the mobile speed v km/h when 
the uplink feedback length is set to Tfb = 30 over a block length of T = 200 symbols. We 
compare analog feedback, error-free digital feedback as well as 4QAM-based digital feedback. 
It is observed that by dedicating 15% of the uplink resource to the feedback, the uncoded 4QAM 
outperforms the analog feedback. 

V. Allowing for Many Users 

We conclude this paper by providing a discussion on the relevant case of K > N t . Until now 
we have assumed that the number of users is fixed equal to the number of BS antennas N t . 
In a real system there are often more than N t users (with data awaiting at the BS). If more 
users feedback their channel information, the BS can utilize user selection and generally obtain 
a non-negligible increase in downlink spectral efficiency. Of course, allowing additional users 
to feed back will incur a larger uplink bandwidth cost. Indeed, a well designed system should 
optimize not only the total number of feedback symbols used on the uplink, but also the number 
of users who feed back their channel state. When the number of users enters into the picture, 
we see that the uplink-downlink tradeoff, which appeared rather trivial for a fixed number of 
users, becomes indeed interesting and non-trivial. 
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Although the lower bound of [12] does not hold when user selection is performed, it can 
be numerically verified that it is nonetheless a reasonable approximation of the rate with user 
selection and imperfect CSIT. For the sake of the space limitation, we focus on the separate 
uplink/downlink bands (model 2) although the other models can be adapted to the case of K > N t 
in a same manner. The corresponding downlink spectral efficiency is the solution to 

w (Tfe, K) ^ T max <r (l - ^ Uf - log (l + + A{T & )\ ) (63) 

where now Rjf denotes the perfect CSIT rate with ZF beamforming and user selection [19], 
[20] and K users. This is computed via Monte Carlo simulation due to the lack of an analytical 
expression. Since the T^ feedback symbols are now split between K users, we now have 
A(Tfb) = p (1 + p) K ( N t- 1 ) for the case of error-free digital feedback. 

In Fig. HI the downlink sum spectral efficiency w (Tfb, K) is plotted versus Tfb for K = 
4, . . . , 8. The spectral efficiency is maximized by letting K = 4, 5, 6, 7, 8 users feedback for 
T ft < 24, 25 < T ft < 29, 30 < T^ < 36, 37 < < 41, T ft > 42, respectively. Thus, the 
sum spectral efficiency is maximized by having approximately users feedback; this is very 
consistent with the findings of [21]. If the number of users is fixed to K = 4 there is virtually no 
benefit in increasing Tfb beyond 35 or 40 because at that point the feedback channel is essentially 
perfect. However, a larger Tfb enables more users to feed back and yields a non-negligible gain 
in the achievable rate. For Tb < 200, it turns out that no more than 31 users are needed. In 
Fig. [10] the same plot is given for K = 4, ... ,31, for ideal digital and QAM feedback. As 
Tfb increases the marginal benefit of feedback (i.e., the slope) decreases, but adding users does 
provide a reasonable benefit even up to the 31-st user. 

We can also consider the tradeoff between uplink and downlink rate as done before. Plotted 
in Fig. [TTJ are the uplink and downlink sum rates, using precisely the same parameters as Fig. 
|4] (i.e., T c = 1 msec and W c = 200 kHz). We now see a non-trivial tradeoff for downlink rates 
larger than 1750 kbps (as before, it does not make sense to choose a smaller downlink rate 
than this unless uplink data rate is much more strongly preferred than downlink data rate). If 
uplink and downlink data rates are equally weighted, the optimal operating point corresponds to 
(approximately) R up = 828 Kbps and -Rdown = 1966 Kbps, which is achieved with K = 11 and 
Tfb = 63 symbols. Note that the substantial benefit of allowing more users to feed back means 
that roughly 30 % of the uplink bandwidth is used for channel feedback. 
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Fig. 1. Different time-frequency block models. 
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Fig. 2. Feedback/training length vs. block length for Nt — 4, p = 10 dB. 
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Fig. 3. Sum spectral efficiency vs. block length T. 
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Fig. 4. Downlink vs. uplink tradeoff for Nt = 4, p = 10 dB, T = 200 symbols. 
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Fig. 5. Feedback length vs. A for N t = 4, p = 10 dB, T = 200 symbols. 
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Fig. 6. Downlink vs. uplink tradeoff over the delayed feedback for Nt = 4, p — 10 dB, T = 200 symbols. 




Fig. 7. Feedback length vs. A for N t = 4, p = 10 dB, T = 200 symbols. 
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Fig. 8. Downlink rate vs. mobile speed for Tn, — 30, T = 200. 




Fig. 9. Downlink sum spectral efficiency vs. feedback symbols (Tfb) for T — 500, Nt = 4, p = 10 dB, for K from 4 to 8 
users. 
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Fig. 10. Downlink sum spectral efficiency vs. feedback symbols (Tfb) for T = 500, N t = 4, p = 10 dB, for K from 4 to 31 
users. 
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Fig. 11. Downlink vs. uplink tradeoff for Nt = 4, p = 10 dB, T = 200 symbols; allowing for up to 31 users to feed back. 
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